Learning to pour with a robot arm combining goal and shape learning for dynamic movement primitives

نویسندگان

  • Minija Tamosiunaite
  • Bojan Nemec
  • Ales Ude
  • Florentin Wörgötter
چکیده

When describing robot motion with dynamic movement primitives (DMPs), goal (trajectory endpoint), shape and temporal scaling parameters are used. In reinforcement learning with DMPs, usually goals and temporal scaling parameters are predefined and only the weights for shaping a DMP are learned. Many tasks, however, existwhere the best goal position is not a priori known, requiring to learn it. Thus, herewe specifically address the question of how to simultaneously combine goal and shape parameter learning. This is a difficult problem because learning of both parameters could easily interfere in a destructive way. We apply value function approximation techniques for goal learning and direct policy search methods for shape learning. Specifically, we use ‘‘policy improvement with path integrals’’ and ‘‘natural actor critic’’ for the policy search. We solve a learning-to-pour-liquid task in simulations as well as using a Pa10 robot arm. Results for learning from scratch, learning initialized by human demonstration, as well as for modifying the tool for the learned DMPs are presented. We observe that the combination of goal and shape learning is stable and robust within large parameter regimes. Learning converges quickly even in the presence of disturbances, which makes this combined method suitable for robotic applications. © 2011 Elsevier B.V. All rights reserved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)

In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...

متن کامل

Movement generation by learning from demonstration and generalization to new targets

We provide a general approach for learning robotic movements from human demonstration. To represent a recorded movement, a non-linear differential equation is adapted such that it reproduces this movement. Based on this representation, we build a library of movements by labeling each recorded movement according to task and context (e.g., grasping, placing, and releasing). Our differential equat...

متن کامل

Skill Learning and Inference Framework

We propose a skill learning and inference framework, which includes five processing modules as follows: 1) human demonstration process, 2) autonomous segmentation process, 3) process of learning dynamic movement primitives, 4) process of learning Bayesian networks, 5) process of constructing motivation graph and inferring skills. Based on the framework, the robot learns and infers situation-ade...

متن کامل

Motor Schemas in Robot Learning

Motor schemas used for robot learning are sequences of action that accomplish a goal-directed behavior, or a task. Motor schemas in robot learning are also known as movement primitives, basis behaviors, units of action, and macro actions. Rather than representing the simplest elementary actions available to the robot, such as a simple command to a robot actuator, schemas and motion primitives r...

متن کامل

Realtime Planning for High-DOF Deformable Bodies using Two-Stage Learning

We present a method for planning the motion of arbitrarily-shaped volumetric deformable bodies or robots through complex environments. Such robots have very highdimensional configuration spaces and we compute trajectories that satisfy the dynamics constraints using a two-stage learning method. First, we train a multitask controller parameterized using dynamic movement primitives (DMP), which en...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Robotics and Autonomous Systems

دوره 59  شماره 

صفحات  -

تاریخ انتشار 2011